FUSE: Multi-faceted Set Expansion by Coherent Clustering of Skip-Grams

نویسندگان

چکیده

Set expansion aims to expand a small set of seed entities into complete relevant entities. Most existing approaches assume the input is unambiguous and completely ignore multi-faceted semantics As result, given {"Canon", "Sony", "Nikon"}, previous models return one mixed that are either Camera Brands or Japanese Companies. In this paper, we study task expansion, which capture all semantic facets in multiple sets entities, for each facet. We propose an unsupervised framework, FUSE, consists three major components: (1) facet discovery module: identifies entity by extracting clustering its skip-grams, (2) fusion discovers shared entire optimization formulation, (3) expands utilizing masked language model with pre-trained BERT models. Extensive experiments demonstrate FUSE can accurately identify generate quality

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modeling Harmony with Skip-Grams

String-based (or viewpoint) models of tonal harmony often struggle with data sparsity in pattern discovery and prediction tasks, particularly when modeling composite events like triads and seventh chords, since the number of distinct n-note combinations in polyphonic textures is potentially enormous. To address this problem, this study examines the efficacy of skip-grams in music research, an a...

متن کامل

Protein classification using modified n-grams and skip-grams.

Motivation Classification by supervised machine learning greatly facilitates the annotation of protein characteristics from their primary sequence. However, the feature generation step in this process requires detailed knowledge of attributes used to classify the proteins. Lack of this knowledge risks the selection of irrelevant features, resulting in a faulty model. In this study, we introduce...

متن کامل

Ready…set…fuse

In This Issue In This Issue Ready…set…fuse he fusion of two lipid membranes underlies a huge range of biological phenomena, from infection by enveloped viruses to the secretion of cellular proteins , but a central aspect of membrane fusion has remained mysterious: do the T proteins that mediate fusion act cooperatively or independently? On page 833, Markovic et al. argue that teamwork is the or...

متن کامل

A Unified Learning Framework of Skip-Grams and Global Vectors

Log-bilinear language models such as SkipGram and GloVe have been proven to capture high quality syntactic and semantic relationships between words in a vector space. We revisit the relationship between SkipGram and GloVe models from a machine learning viewpoint, and show that these two methods are easily merged into a unified form. Then, by using the unified form, we extract the factors of the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2021

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-030-67664-3_37